# Reinforcement Learning Inference Optimization
Acereason Nemotron 14B
Other
AceReason-Nemotron-14B is a math and code reasoning model trained through reinforcement learning, based on DeepSeek-R1-Distilled-Qwen-14B, excelling in math and code reasoning tasks.
Large Language Model
Transformers

A
nvidia
7,863
70
Open RS1
MIT
A small-scale large language model enhanced by reinforcement learning, focused on improving the reasoning capabilities of a 1.5B parameter model
Large Language Model
Transformers

O
knoveleng
6,229
4
Featured Recommended AI Models